New findings on the use of static code attributes for defect prediction Muhammed

نویسندگان

  • Muhammed Maruf Öztürk
  • Ahmet Zengin
چکیده

Defect prediction includes tasks that are based on methods gener ated using software fault data sets and requires much effort to be completed. In defect prediction, although there are methods to conduct an analysis involving the classification of data sets and localisation of defects, those methods are not sufficient without eliminating repeated data points. The NASA Metrics Data Program (Nasa MDP) and Software Research Laboratory (SOFTLAP) data sets are frequently used in this field. Here, we present a novel method developed on the Nasa MDP and SOFTLAB data sets that detects repeated data points and analyses low level metrics. Also, a framework and an algorithm are presented for the proposed method. Statistical methods have been used for detecting repeated data points. This work sheds new lights on the extent to which repeated data adversely affects defect prediction performance, and stresses the importance of using low level metrics.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Comments on "Data Mining Static Code Attributes to Learn Defect Predictors"

In this correspondence, we point out a discrepancy in a recent paper, “Data Mining Static Code Attributes to Learn Defect Predictors,” that was published in this journal. Because of the small percentage of defective modules, using Probability of Detection (pd) and Probability of False Alarm (pf) as accuracy measures may lead to impractical prediction models.

متن کامل

Combining Particle Swarm Optimization based Feature Selection and Bagging Technique for Software Defect Prediction

The costs of finding and correcting software defects have been the most expensive activity in software development. The accurate prediction of defect‐prone software modules can help the software testing effort, reduce costs, and improve the software testing process by focusing on fault-prone module. Recently, static code attributes are used as defect predictors in software defect prediction res...

متن کامل

Cross- vs Within-Company Defect Prediction Studies

In a recent May 2007 IEEE TSE article, Kitchenham et.al. explored effort estimation and found contradictory evidence about the value of crossvs within-company data. Those contradictory results may have been the result of effort estimation features, some of which are subjective in nature. Static code features are different than effort estimation features. They can be generated in an automatic, r...

متن کامل

Extracting software static defect models using data mining

Defect models; Software testing; Software metrics; Defect prediction Abstract Large software projects are subject to quality risks of having defective modules that will cause failures during the software execution. Several software repositories contain source code of large projects that are composed of many modules. These software repositories include data for the software metrics of these modu...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2016